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ABSTRACT 

This  paper  continues  studies  initiated  in  [l]  concerning  itera¬ 
tive  methods  of  driving  the  gradient  to  0.  In  [l],  we  were  concerned 
with  functions  which  were  twice  differentiable;  in  this  paper  only 
first  derivatives  will  be  assumed.  Other  results  included  are  a 
fixed  point  theorem  for  "gradient”  operators  and  a  simple  proof 
of  the  classical  method  of  steepest  descent. 
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Let  H  be  a  Hilbert  space,  x^  an  arbitrary  point  of  H 
and  f  a  functional  on  H.  Let  S  denote  the  level  set  of  f  at 
x  ,  viz:  S  =  fx  e  Hi  f(x)  <  f(x  )).  Let  f'(x,«)  denote  the 
Frechet  derivative  of  f  at  x,  and  7f(x)  its  representor  in  H. 

Set  u(x,p)  =  f  (x)  -  f  (x  -  pcp(x)),  g(x,p)  =  A(x,p  )/[vf  (x) ,  cp(x)]p 
and  fix  d,  0  <  d  ^  l/2.  Here  cp  denotes  a  bounded  map  from  S 
to  H,  satisfying  [Vf(x),  cp(x)]  2  0,  such  that  given  c  >0  there 
exists  6  >  0  for  which  [7f(x),  ;p(x)]  <  6  implies  |[tff(x)||  <  c. 

For  example,  see  Remark  4»  below* 

Theorem  1.  Assume  that  on  S  f  is  Frechet  differentiable  and  bounded 
below,  and  that  Vf  is  uniformly  continuous  on  S.  Set  x^+^ 

,  cp(xk)]  =  0*  Otherwise  choose  pk  so  that 
^  ^  1  "  d  when  or  Pk  =  1  when 

0(^,1)  2  o,  and  set  x^  =  -  P^^)*  Then: 

converges  to  0  while  fCx^)  converges  downward 
to  a  limit  L* 
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(b)  If  the  sequence  {x^}  has  cluster  points  then  every 
cluster  point  satisfies  7f(z)  =0*  If  cp  =  7f  has 
finitely  many  zeros  on  S  and  S  is  compact  then  (x,  ] 
converges* 

(c)  If  S  and  f  are  convex,  and  cp(x)  =  Vf(x),  then 

L  =  inf{f(x):  x  c  H}*  If  {x^}  has  weak  cluster  points 
then  all  such  minimize  f* 

Proof*  (a)  We  have  that  A(p,x)  =  p[^f(0>  cp(x) ]  =  p[Vf(x),  cp(x) ] 

+  p[tff(0  -  Vf(x),  <p(x)]  where  t,  lies  on  the  open  line  segment 
joining  x  and  x  -  pcp(x).  Thus  g( p,x)  =  1  +  [tff )  -  Vf(x),  cp(x)  ]/ 
[7f  (x) ,  cp(  x)  ]  •  Since  Vf  is  continuous  at  x  and  ||^  -  x  ||  £p||cp(x) 
g(0,x)  =  1*  If  g(l,x)  <  6,  then  g  being  continuous  takes  on  all 
values  between  1  and  6$  there  exists  therefore  numbers  p  >  0  so 
that  6  £  g(p  fx)  £1-6.  Set  T*f  (x^)  =  cp(x^)  =  and 

= 

=  Pkg(Vpk^fk,cpkl  ^pk6^fk,cpk^  >  °*  111X13  f^xk+l^  <  and 

xk+l  €  ^33ume 


>\]  2  «• 

It  follows  moreover  that  {p^}  is  bounded  away  from  0.  For  if 
not  take  a  thinner  subsequence  £pk),  if  necessary,  such  that 
{pk3  -0.  Since  cpk  is  bounded  on  S,  { 1 1 xR  -  j;  ||}_  0.  By  the 
uniform  continuity  of  V  f  on  S,  it  follows  that 
U>fO;k)  “7f(xk),  cp(xk)]]  -0,  and  therefore  g(Pk,xk)  -1, 
contradicting  that  g(pk,xk)  £  1  -  6  for  all  k*  There  exists 


subsequence 


and  a  number 


such  that 


cp,  ]  />  0,  This  implies  that  there  exists  a 


assume 


Vfk  i  0, 


Thus 
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therefore  a  number  q  >  0  such  that  ^  q.  Hence 

a(x^,p^)  2  qS#,  from  which  we  may  contradict  the  hypothesis  that 

f  is  bounded  below.  Thus  [^f^#^]  -*  0,  showing  (a). 


(o)  Let  z  be  a  cluster  point  of  the  sequence  (x^i*  Th®n 
[Vf ( k) ,cp( z) ]  =  0,  whence  Vf(z)  =  0.  Thus  the  number  of  roots 
of  Vf(z)  is  ^  the  number  of  cluster  points  of  {x^)*  Therefore 
if  Vf  has  a  unique  root  on  S,  then  {x^}  converges  to  it.  If 
the  roots  of  Vf  are  finite  in  number  we  may  suppose  the  cluster 
points  of  (x^)  to  be  thus  also.  Let  x^  be  a  cluster  point  and 
let  x^  be  a  closest  neighboring  cluster  point  to  x^.  Set 
c  =  ||x^-  x2||  and  let  N(x^)  denote  spheres  of  radius  c/3 
centered  at  x^.  H  *v  UN^)  contains  a  finite  number  of  points 
at  most,  say  n.  Since  x^ll  converges  to  0,  we  can 

choose  k  so  that  x^  c  N(x^)  and  ||x^+^-  x^|[  <  e/3m.  But 
v,his  implies  that  H'>/|JN(x^)  contains  more  than  m  points,  which 
is  a  contradiction.  This  contradiction  persists  unless  we  suppose 
that  {x^}  has  a  unique  cluster  point. 

(c)  We  need  the  inequality  f(y)  ^  f(x)  +  [vf(x),y-x]  which 

is  valid  for  all  x  and  y  in  S.  By  the  convexity  of  f, 

[f(x  +  t(y  -  x))  -  f(x)]/t  £  f(y)  -  f (x)  for  t  e  [0,1].  By  the 

differentiability  of  f,  lim[f(x  +  t(y  -  x)  -  f(x)]/t  =  [  Vf  (x)  ,y-x]  • 

t-0 

The  remainder  of  the  proof  is  as  in  [l]. 


Remarks 

1.  The  hypotheses  for  the  first  part  of  (b)  can  be  changed  to: 
Vf  and  <p  continuous  on  S,  and  [cp(x),  f(x)]  0  for  all  xrS, 
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[cp(x),  f(x)]  -  0  only  if  Vf(x)  =  0. 

2.  In  (b)  if  the  sequence  (x^J  has  an  isolated  cluster 
point  then  it  converges* 

3*  The  inequality  in  the  proof  of  (c)  may  sometimes  be  used 
in  (b)  for  terminating  computations.  We  have  f(x^)  >  f(z)  £ 

f(xk)  • 

of  S. 

4.  An  example  for  cp  is  the  following.  Let  H  =  and  let 

6^,  (1  ^  i  ^  n),  denote  the  rows  of  the  identity  matrix.  Choose 

i  so  that  [Vf(x),6^  ]  ^  [vf(x),6^]  Vi,  (l  <£  i  £  n).  Set 

o  ?  - 

cp(x)  =  |j?f(x)||6i,  then  [vf  (x)  ,cp(x)]  £  ||vf(x)  ||  /^n  . 

o 

Corollary.  Let  Q  be  a  uniformly  continuous  non-linear  operator 

on  H  which  is  the  gradient  of  a  convex  "potential"  cp.  Assume  cp 

1 2  2 

satisfies  cp/||x[j  <[p||x||  /2  with  p  <  1*  Then  Q  has  a  constructible 
fixed  point. 

Proof.  Define  f(x)  =  ([x,x]/2d)  -  cp(x)/d,  a  >  0.  Thus 
f(x)  (2d)  1|| x  ||^1  -  p)  and  f  is  bounded  below.  Clearly,  for 
every  set  xq,  the  set  S  =  (x  e  H:f{x)  ^  f(xQ)}  is  closed  convex 
and  bounded,  and  vf  is  uniformly  continuous.  Applying  (c)  we 
construct  a  point  for  which  vf(z)  =0.  Thus  z  is  p  fixed  point  of 
Q.  Q.E.D.  Observe  that  if  H  is  finite  dimensional  we  need  only 
assume  Q  continuous. 

Because  the  method  of  steepest  descent  is  usually  more  laborious 
than  the  algorithm  above,  it  is  not  as  practical.  We  shall  discuss 
it  below,  however,  because  the  proofs  require  slightly  different 


f(\) +  ^ 


1 1<? f^ || D,  where  D  is  the  diameter 
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techniques* 

For  simplicity  we  restrict  our  attention  to  gradient  directions 
and  to  sequences  with  cluster  points. 

Theorem  2  (Steepest  Descent).  Assume  f  continuous  on  S. 

Choose  o  >  0  to  minimize  f(x^  -pTf^));  set  x^+1  =  -  p^vf^), 

and  assume  that  {x^}  has  cluster  points.  Then  (b)  of  Theorem  (l) 
may  be  asserted. 

Proof.  Let  z  be  a  cluster  point  of  xn,  clearly  f(xn)  converges 
downward  to  f(z).  We  shall  show  that  Vf(z)  =  0.  For  suppose  not 
and  let  pQ  >  0  minimize  f(z  -  pvf(z)).  Thus  f(z)  =  f(z  -  pQ7f(z))  +  *1 

for  some  r\  >  0.  But  for  every  x  ,  f(x  -  p  7f(x  )) 

=  f(z-pQ7f(z)  +  (xn  -  z)  +  po(7f(z)  -  f(xn)))  =  f(z  -  (o7f(z)) 

+  [nffcn)»xn  -  z  +  pQ(Vf(z),  -  f(xn))]$  with  £n  in  "between" 

(z  -  pQ7f(z)  and  xn  -  ponf (x^).  Let  {xnJ  be  a  subsequence 
converging  to  z.  Then  Vffc^)  converges  to  ^(z  -  pQ7f(z))  and 
x^  -  z  +  pQ[7f(z)  -  7f(xn)]  converges  to  0.  Hence  for  n  sufficiently 
large  f(xn  -  pQVf(xn))  £  f(z  -  pQVf(z))  +  q/2  =  f(z)  -  rj/, 2.  But 

f(z)  <  f(xn  -  Pn^f(xn))  £  f(xn  -  P0^f(xn))  £  f(z)  -  n/2>  a  contra¬ 

diction;  hence  ^fCz)  =  0. 

A  modified  Steepest  Descent  due  to  H.  Curry  is  more  difficult 
to  prove.  Curry’s  proof  is  geometric,  intricate, and  much  abbreviated. 

We  prove  his  assertion  below. 

Theorem  3  (Curry  [2]).  Assume  7f  continuous  on  S  and  choose 
p^.  to  be  first  zero  of  -~(f(x  -  pvf(x)))  on  the  half-ray 
{x  -  o  f(x):p  ^  0).  Assume  that  the  sequence  {x^}  has  cluster 
points.  Then  (b)  of  Theorem  1  can  be  asserted. 
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Proof*  As  above,  set  Af(x,p)  =  f(x)  -  f(x  -  pVf(x)).  Then 
^  =  -[7f(u),^]  =  [7f  (x  -  p7f(x) ,7 f (x)]  ,  and  pg(x,p)  = 
f(x,p)/||T7f(x)  ||^.  Let  p  be  the  least  positive  root  of 
(pg(x,p))»  =  [Vf(x  -  pVf(x)),Vf(x)]/  ||7f(x)||2.  Thus 

pg(x,p)  =  f*  [7f(_xJ.  -  Pf (x).i7f.L30J  ^  #  Assume  now  Vf(x.  )  does 
0  ||f(x)||2  ^ 

not  converge  to  0*  Then  for  some  cluster  point  z  of  {x^), 

7f(z)  /  0,  For  each  x  consider  the  above  integrand  as  a  function 
of  p.  The  integrand  is  1  at  p  =  0  and  0  at  p  =  p.  We 

shall  choose  pQ  so  that  the  integrand  I(x,p)  is  2  l/2*  Tak* 
a  subsequence  {x^}  which  converges  to  z*  This  sequence  with  its 
limit  point  is  a  compact  subset  of  S,  so  that  7f  is  uniformly 
continuous  on  this  compactun,  which  we  call  K*  There  exists  there¬ 
fore  positive  numbers  q  and  r  such  that  for  all  K 
q  <  ||'M,(xk)  ||  <  r.  By  the  uniform  continuity  of  Vf  on  K,  given 
e  =  q/2  there  exists  6  such  that  ||vf(x  -  pvf(x))  -  vf(x)||  <  q/2  < 
||Vf(x)||/2  whenever  p||Vf(x)||  <  6,  for  all  x  e  K.  Choose 

PQ  =  6/r.  Then  if  p  ^  pQ,  p  |Jvf  (x)  |[  <  6.  Thus  if  p  £  pQ  and 

x  e  K,  ||vf(x  -  p7f(x)  -  7f(x)||  <  ||vf  (x)  ||/2.  Thus 


1  >  IFf("x)]|^f(x"  p7f(x)  £  [ffCx)ll  f  (x  "  p7f  (x)  W 

-  "“f»P  •  tiWt1 »“  proves  that  if  x  «  K 


and  p  £  p  ,  I(x,p)  >  £  and  therefore  p  <  p.  It  follows  that 
o  o 

pg(p)  >  ^po|dp  =  ^pQ  and  thus  uf(xk,pk)  2 
that  f  is  bounded  below  on  K* 


contradicting 
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